Minimum Sample Risk Methods for Language Modeling
نویسندگان
چکیده
This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training procedure. Evaluations on the task of Japanese text input show that MSR can handle a large number of features and training samples; it significantly outperforms a regular trigram model trained using maximum likelihood estimation, and it also outperforms the two widely applied discriminative methods, the boosting and the perceptron algorithms, by a small but statistically significant margin.
منابع مشابه
Minimum Sample Risk Methods for Language Modeling1
This paper proposes a new discriminative training method, called minimum sample risk (MSR), of estimating parameters of language models for text input. While most existing discriminative training methods use a loss function that can be optimized easily but approaches only approximately to the objective of minimum error rate, MSR minimizes the training error directly using a heuristic training p...
متن کاملPredictive Risk Mapping of Leptospirosis for North of Iran Using Pseudo-absences Data
Leptospirosis is a common zoonosis disease with a high prevalence in the world and is recognized as an important public health drawback in both developing and developed countries owing to epidemics and increasing prevalence. Because of the high diversity of hosts that are capable of carrying the causative agent, this disease has an expansive geographical reach. Various environmental and social ...
متن کاملA FUZZY MINIMUM RISK MODEL FOR THE RAILWAY TRANSPORTATION PLANNING PROBLEM
The railway transportation planning under the fuzzy environment is investigated in this paper. As a main result, a new modeling method, called minimum risk chance-constrained model, is presented based on the credibility measure. For the convenience ofs olving the mathematical model, the crisp equivalents ofc hance functions are analyzed under the condition that the involved fuzzy parameter...
متن کاملModeling Gold Volatility: Realized GARCH Approach
F orecasting the volatility of a financial asset has wide implications in finance. Conditional variance extracted from the GARCH framework could be a suitable proxy of financial asset volatility. Option pricing, portfolio optimization, and risk management are examples of implications of conditional variance forecasting. One of the most recent methods of volatility forecasting is Real...
متن کاملSystems Risk Analysis UsingHierarchical Modeling
A fresh look at the system analysis helped us in finding a new way of calculating the risks associated with the system. The author found that, due to the shortcomings of RPN, more researches needed to be done in this area to use RPNs as a new source of information for system risk analysis. It is the purpose of this article to investigate the fundamental concepts of failure modes and effects ana...
متن کامل